The Best 1347 Image Classification Tools in 2025
Nsfw Image Detection
Apache-2.0
An NSFW image classification model based on the ViT architecture, pre-trained on ImageNet-21k via supervised learning and fine-tuned on 80,000 images to distinguish between normal and NSFW content.
Image Classification
Transformers

N
Falconsai
82.4M
588
Fairface Age Image Detection
Apache-2.0
An image classification model based on Vision Transformer architecture, pre-trained on the ImageNet-21k dataset, suitable for multi-category image classification tasks
Image Classification
Transformers

F
dima806
76.6M
10
Dinov2 Small
Apache-2.0
A small-scale vision Transformer model trained using the DINOv2 method, extracting image features through self-supervised learning
Image Classification
Transformers

D
facebook
5.0M
31
Vit Base Patch16 224
Apache-2.0
Vision Transformer model pre-trained on ImageNet-21k and fine-tuned on ImageNet for image classification tasks
Image Classification
V
google
4.8M
775
Vit Base Patch16 224 In21k
Apache-2.0
A Vision Transformer model pretrained on the ImageNet-21k dataset for image classification tasks.
Image Classification
V
google
2.2M
323
Dinov2 Base
Apache-2.0
Vision Transformer model trained using the DINOv2 method, extracting image features through self-supervised learning
Image Classification
Transformers

D
facebook
1.9M
126
Gender Classification
An image classification model built with PyTorch and HuggingPics for recognizing gender in images
Image Classification
Transformers

G
rizvandwiki
1.8M
48
Vit Base Nsfw Detector
Apache-2.0
An image classification model based on Vision Transformer (ViT) architecture, specifically designed to detect whether images contain NSFW (Not Safe For Work) content.
Image Classification
Transformers

V
AdamCodd
1.2M
47
Vit Hybrid Base Bit 384
Apache-2.0
The Hybrid Vision Transformer (ViT) model combines convolutional networks and Transformer architectures for image classification tasks, excelling on ImageNet.
Image Classification
Transformers

V
google
992.28k
6
Gender Classification 2
This is an image classification model based on the PyTorch framework and generated using HuggingPics tools, specifically designed for gender classification tasks.
Image Classification
Transformers

G
rizvandwiki
906.98k
32
Mobilevit Small
Other
MobileViT is a lightweight, low-latency vision Transformer model that combines the strengths of CNNs and Transformers, making it suitable for mobile devices.
Image Classification
Transformers

M
apple
894.23k
65
Phikon
Other
Phikon is a self-supervised learning model for histopathology based on iBOT training, primarily used for extracting features from histology image patches.
Image Classification
Transformers English

P
owkin
741.63k
30
Vit Tiny Patch16 224
Apache-2.0
ViT-Tiny model converted from the timm repository, suitable for image classification tasks, with usage consistent with the ViT-base model
Image Classification
Transformers

V
WinKawaks
692.49k
21
Dinov2 Large
Apache-2.0
A vision Transformer model trained using the DINOv2 method, extracting robust visual features from massive image data through self-supervised learning
Image Classification
Transformers

D
facebook
558.78k
79
Beit Base Patch16 224 Pt22k Ft22k
Apache-2.0
BEiT is a Vision Transformer (ViT)-based image classification model, pre-trained in a self-supervised manner on ImageNet-22k and fine-tuned on the same dataset.
Image Classification
B
microsoft
546.85k
76
Vit Small Patch16 224
Apache-2.0
ViT-tiny model converted from timm codebase, suitable for image classification tasks
Image Classification
Transformers

V
WinKawaks
447.70k
18
Rad Dino
Other
Vision Transformer model trained with self-supervised DINOv2, specifically designed for encoding chest X-ray images
Image Classification
Transformers

R
microsoft
411.96k
48
Swinv2 Tiny Patch4 Window16 256
Apache-2.0
Swin Transformer v2 is a vision Transformer model that achieves efficient image classification through hierarchical feature maps and local window self-attention mechanisms.
Image Classification
Transformers

S
microsoft
403.69k
5
Swin Base Patch4 Window7 224
Apache-2.0
Swin Transformer is a hierarchical vision transformer based on shifted windows, suitable for image classification tasks.
Image Classification
Transformers

S
microsoft
281.49k
15
Efficientnet B2
Apache-2.0
EfficientNet is a mobile-friendly pure convolutional model that achieves excellent performance in image classification tasks by uniformly scaling depth/width/resolution dimensions with compound coefficients.
Image Classification
Transformers

E
google
276.94k
2
Resnet 50
Apache-2.0
ResNet-50 is a residual network model pre-trained on ImageNet-1k, using the v1.5 architecture improvement, suitable for image classification tasks.
Image Classification
R
microsoft
273.80k
407
Plant Disease Detection Project
Other
MobileNet V2 is a lightweight convolutional neural network designed for mobile devices, achieving a balance between latency, model size, and accuracy.
Image Classification
Transformers

P
Diginsa
242.43k
4
Beit Large Patch16 224
Apache-2.0
BEiT is an image classification model based on Vision Transformer (ViT) architecture, pretrained with self-supervised learning on ImageNet-21k and fine-tuned on ImageNet-1k.
Image Classification
B
microsoft
222.46k
1
Prov Gigapath
Apache-2.0
Prov-GigaPath is a whole-slide foundation model for digital pathology based on real-world data, designed to extract patch-level and slide-level features from pathology slides.
Image Classification
P
prov-gigapath
193.45k
131
Vit Large Patch16 224
Apache-2.0
Large-scale image classification model based on Transformer architecture, pre-trained and fine-tuned on ImageNet-21k and ImageNet-1k datasets
Image Classification
V
google
188.47k
30
Skin Type
An image classification model for categorizing human skin types, committed to fairness to ensure accurate performance across all skin tones.
Image Classification
Transformers

S
driboune
182.21k
3
Vit Large Patch16 384
Apache-2.0
Vision Transformer (ViT) is an image classification model based on the transformer architecture, pre-trained on ImageNet-21k and fine-tuned on ImageNet.
Image Classification
V
google
161.29k
12
Nsfw Image Detection 384
Apache-2.0
Lightweight NSFW image detection model with 98.56% accuracy, size only 1/18-1/20 of similar models
Image Classification
N
Marqo
158.92k
21
Deit Base Patch16 224
Apache-2.0
DeiT is a data-efficient image Transformer model trained with attention mechanisms, pretrained and fine-tuned on the ImageNet-1k dataset at 224x224 resolution.
Image Classification
Transformers

D
facebook
152.63k
13
Resnet 18
Apache-2.0
ResNet model trained on ImageNet-1k, utilizing residual connection structure, supports image classification tasks
Image Classification
Transformers

R
microsoft
133.01k
52
Deepfake Vs Real Image Detection
Apache-2.0
An image classification model based on Vision Transformer architecture, used to detect real images versus AI-generated fake images.
Image Classification
Transformers

D
dima806
129.66k
27
Dino Vitb16
Apache-2.0
A Vision Transformer model trained using the DINO self-supervised method, based on the ViT architecture and pretrained on the ImageNet-1k dataset.
Image Classification
Transformers

D
facebook
122.46k
108
Vit Large Patch14 Reg4 Dinov2.lvd142m
Apache-2.0
A Vision Transformer (ViT) image feature model with registers, pre-trained using self-supervised DINOv2 method on the LVD-142M dataset.
Image Classification
Transformers

V
timm
119.48k
7
Vit Large Patch32 384
Apache-2.0
This Vision Transformer (ViT) model is pre-trained on the ImageNet-21k dataset and then fine-tuned on the ImageNet dataset, suitable for image classification tasks.
Image Classification
V
google
118.37k
16
Dinov2 Giant
Apache-2.0
A vision Transformer model trained using the DINOv2 method for self-supervised image feature extraction
Image Classification
Transformers

D
facebook
117.56k
41
Dino Vits8
Apache-2.0
A Vision Transformer model trained with self-supervised DINO method using 8x8 image patches, suitable for image feature extraction tasks
Image Classification
Transformers

D
facebook
106.97k
12
Swin Tiny Patch4 Window7 224
Apache-2.0
Swin Transformer is a hierarchical vision Transformer that achieves linear computational complexity by computing self-attention within local windows, making it suitable for image classification tasks.
Image Classification
Transformers

S
microsoft
98.00k
42
Pedestrian Gender Recognition
Apache-2.0
This model is an image classification model fine-tuned on the PETA dataset based on the BEiT architecture, used for recognizing pedestrian gender with an accuracy of 91.07%.
Image Classification
Transformers

P
NTQAI
93.78k
15
Vit Large Patch16 224 In21k
Apache-2.0
A Vision Transformer model pretrained on the ImageNet-21k dataset, suitable for image feature extraction and downstream task fine-tuning.
Image Classification
V
google
92.63k
26
Vit Small Patch16 224.dino
Apache-2.0
An image feature model based on Vision Transformer (ViT), trained using the self-supervised DINO method, suitable for image classification and feature extraction tasks.
Image Classification
Transformers

V
timm
70.62k
4
Mobilenet V2 1.0 224
Other
MobileNet V2 is a lightweight vision model optimized for mobile devices, excelling in image classification tasks.
Image Classification
Transformers

M
google
69.47k
29
Nsfw Image Detector
Apache-2.0
A NSFW (Not Safe For Work) content detection model fine-tuned based on Google Vision Transformer, capable of identifying 5 types of image content
Image Classification
Transformers

N
LukeJacob2023
68.26k
17
- 1
- 2
- 3
- 4
- 5
- 6
- 10